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Abstract 

Recent studies inspired by results from random matrix theory [Q, ||, ||| found 
that covariance matrices determined from empirical financial time series ap- 
pear to contain such a high amount of noise that their structure can essentially 
be regarded as random. This seems, however, to be in contradiction with the 
fundamental role played by covariance matrices in finance, which constitute 
the pillars of modern investment theory and have also gained industry-wide 
applications in risk management. Our paper is an attempt to resolve this em- 
barrassing paradox. The key observation is that the effect of noise strongly 
depends on the ratio r = n/T, where n is the size of the portfolio and T the 
length of the available time series. On the basis of numerical experiments and 
analytic results for some toy portfolio models we show that for relatively large 
values of r (e.g. 0.6) noise does, indeed, have the pronounced effect suggested 
by |, H | and illustrated later by [|, |] in a portfolio optimization context, 
while for smaller r (around 0.2 or below), the error due to noise drops to ac- 
ceptable levels. Since the length of available time series is for obvious reasons 
limited in any practical application, any bound imposed on the noise-induced 
error translates into a bound on the size of the portfolio. In a related set 
of experiments we find that the effect of noise depends also on whether the 
problem arises in asset allocation or in a risk measurement context: if covari- 
ance matrices are used simply for measuring the risk of portfolios with a fixed 
composition rather than as inputs to optimization, the effect of noise on the 
measured risk may become very small. 
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1 Introduction 



Covariance matrices of financial returns play a crucial role in several branches of finance 
such as investment theory, capital allocation or risk management. For example, these 
matrices are the key input parameters to Markowitz's classical portfolio optimization 
problem ||, which aims at providing a recipe for the composition of a portfolio of 
assets such that risk (quantified by the standard deviation of the portfolio's return) is 
minimized for a given level of expected return. For any practical use of the theory it 
would therefore be necessary to have reliable estimates for the volatilities and correla- 
tions of the returns on the assets making up the portfolio (i.e. for the elements of the 
covariance matrix), which are usually obtained from historical return series. However, 
the finite length T of the empirical time series inevitably leads to the appearance of 
noise (measurement error) in the covariance matrix estimates. It is clear that this noise 
becomes stronger and stronger with increasing portfolio size n, until at a certain n one 
overexploits the available information to such a degree that the positive definiteness of 
the covariance matrix (and with that the meaning of the whole exercise) is lost. 

This long known difficulty has been put into a new light by ^ [2|, §j where the 
problem has been approached from the point of view of random matrix theory. These 
studies have shown that empirical correlation matrices deduced from financial return 
series contain such a high amount of noise that, apart from a few large eigenvalues and 
the corresponding eigenvectors, their structure can essentially be regarded as random. 
In 0, e.g., it is reported that about 94% of the spectrum of correlation matrices de- 
termined from return series on the S&P 500 stocks can be fitted by that of a random 
matrix. One wonders how, under such circumstances, covariance matrices can be of 
any use in finance. Indeed, in p] the authors conclude that "Markowitz's portfolio op- 
timization scheme based on a purely historical determination of the correlation matrix 
is inadequate" . 

Two subsequent studies ||, ^| found that the risk level of optimized portfolios could 
be improved if prior to optimization one filtered out the lower part of the eigenvalue 
spectrum of the covariance matrix, thereby removing the noise (at least partially). In 
both of these studies, portfolios have been optimized by using the covariance matrix 
extracted from the first half of the available empirical sample, while risk was measured 
as the standard deviation of the return on these portfolios in the second half of the 
sample. @, || found a significant discrepancy between "predicted" risk (as given by the 
standard deviation of the optimal portfolio in the first half of the sample) and "realized" 
risk (given by its actual realization in the second half), although this discrepancy 
could be diminished by the use of the filtering technique. While these results suggest 
potential applications of random matrix theory, they also reinforce the doubts about 
the usefulness of empirical covariance matrices. 

On the other hand, Markowitz's theory is one of the pillars of present day finance. 
For example, the Capital Asset Pricing Model (CAPM), which plays a kind of bench- 
mark role in portfolio management, was inspired by Markowitz's approach; various 
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techniques of capital allocation are based on similar ideas. Furthermore, over the 
years, covariance matrices have found industry-wide applications also in risk man- 
agement. For example, RiskMetrics [0, which is perhaps the most widely accepted 
methodology for measuring market risk, uses covariance matrices as its fundamental 
inputs. The presence of such a high degree of noise in empirical covariance matrices 
as suggested by |T], 0, [| |], [J and the fact that these matrices are so widely utilized in 
the financial industry constitute an intriguing paradox. 

The motivation for our previous study || stemmed from this context. In addition 
to the noise due to the finite length of time series, real data always contain additional 
sources of error (non-stationarity, changes in the composition of the portfolio, in regu- 
lation, in fundamental market conditions, etc.). In order to get rid of these parasitic 
effects, we based our analysis on data artificially generated from some toy models. This 
procedure offers a major advantage in that the "true" parameters of the underlying 
stochastic process, hence also the statistics of the covariance matrix are exactly known. 
Furthermore, with a comparison to empirical data in mind, where the determination 
of expected returns becomes an additional source of uncertainty, we confined ourselves 
to the study of the minimal risk portfolio. Our main finding was that for parameter 
values typically encountered in practice the "true" risk of the minimum-risk portfolio 
determined in the presence of noise (i.e. based on the covariance matrix deduced from 
finite time series) is usually no more that 10-15% higher than that of the portfolio 
determined from the "true" covariance matrix. 

In the present work we continue and extend our previous analysis, but keep to the 
same toy-model-based approach as before. These models can be treated both numeri- 
cally and, in the limit when n and T go to infinity with r = n/T = fixed, analytically. 
Varying the ratio r = n/T we show that the difference between "predicted" and "real- 
ized" risk can, indeed, reach the high values found in 0, [5| when r is chosen as large 
as in those papers, but decreases significantly for smaller values of this ratio. This 
observation eliminates the apparent contradiction between 0, || and our earlier results 
||. Since in the simulation framework we know the exact process, not only its finite 
realizations, we can compare the "predicted" and "realized" risk to the "true" risk of 
the portfolio. We find that "realized" risk is a good proxy for "true" risk in all cases of 
practical importance and that "predicted" risk is always below, whereas "realized" risk 
is above the "true" risk. For asymptotically small values of n/T all the noise vanishes, 
but the value of T is, for evident reasons, limited in any practical application, therefore 
any bound one would like to impose on the effect of noise translates, in fact, into a 
constraint on the portfolio size n. 

Regarding one other aspect of the problem, we find that the effect of noise is very 
different depending on whether we wish to optimize the portfolio, or merely want to 
measure the risk of a given, fixed portfolio. While in the former case the effect of noise 
remains important up to relatively small values of n/T, in the latter case it becomes 
insignificant much sooner. This explains why covariance matrices could have remained 
a fundamental risk management tool even to date. 
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2 Results and Discussion 



We consider the following simplified version of the classical portfolio optimization prob- 
lem: the portfolio variance Yn,j=i w i a ij w j * s t° be minimized under the budget con- 
straint J2i=i w i = 1; where denotes the weight of asset i in the portfolio and 
represents the covariance matrix of returns. One could, of course, impose additional 
constraints (e.g. the usual one on the return), but this simplified form provides the 
most convenient laboratory to test the effect of noise, since it eliminates the uncertainty 
coming from the determination of expected returns. The solution to the optimization 
problem can then be found using the method of Lagrange multipliers, and after some 
trivial algebra one obtains for the weights of the optimal portfolio: 
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Z-tj,k=l a jk 

Starting with a given "noiseless " covariance matrix aff we generate "noisy " covari- 



ance matrices cr,-„- as 

1 T 

4 1 =7p^2yitVjt, (2) 
1 t=i 

where y it = J2]=i Xj t , with Xj t ~ i.i.d. N(0, 1) and Ljj the Cholesky decomposition of 
the matrix aff (a lower triangular matrix which satisfies J2k=i Lik Ljk = 0y , or L L T = 
a*- -*). In this way we obtain "return series" yu that have a distribution characterized 
by the "true" covariance matrix erj^ , while aff will correspond to the "empirical" 
covariance matrix. Of course, in the limit T — > oo the noise disappears and a^' —>■ af) . 
The main advantage of this simulation approach over empirical studies is that the 
"true" covariance matrix is exactly known. 

For our experiments we choose two simple forms for a\j . First, we perform our 
simulations with the simplest possible form for cr^, the identity matrix (Model I). 
In order to move a little closer to the observed structures, however, we also perform 
experiments with matrices cr« which have one eigenvalue chosen to be about 25 times 
larger than the rest and with the corresponding eigenvector (representing the "whole 
market") in the direction of (1, 1, ... , 1), while keeping the simplicity of the identity 
matrix in the other directions (Model II). This latter is meant to be a caricature of the 
covariance matrices deduced from financial return series (see 0, |3|). 

In order to see the effect of noise on the portfolio optimization problem we compare 
the square roots of the following quantities: 

1- ^H,j=i w i°^* a if w f^*i the "true" risk of the optimal portfolio without noise, where 
w^* denotes the solution to the optimization problem without noise; 

2- Z)ij=i w i^* a ij w j^* ^ the "true" risk of the optimal portfolio determined in the 
case of noise, where w\ denotes the solution to the optimization problem in the 
presence of noise; 
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3- 12ij=i Wi °jj w f^* > the "predicted" risk (cf. ||, ||), that is the risk that can be 
observed if the optimization is based on a return series of length T; 

4- £™ i= i ajf the "realized" risk (cf. |§ g]), that is the risk that would 

(2) 

be observed if the portfolio were held one more period of length T, where a\j is 
the covariance matrix calculated from the returns in the second period. 

To facilitate comparison, we calculate the ratios of the square roots of the three 
latter quantities to the first one, and denote these ratios by go, Qi and g2, respectively. 
That is qo,qi and g 2 represent the "true, "the "predicted" resp. the "realized" risk, 
expressed in units of the "true " risk in the absence of noise. For both model covariance 
matrices cr|° , we perform simulations for different values of the number of assets n and 
length of the time series T. For each given n and T, we generate several return series 
and covariance matrices, and each time we calculate the corresponding ratios g , q% and 
g 2 . Finally, we calculate the mean and standard deviation of these quantities for given 
n and T. 



Table 1: "True", "predicted", and "realized" risk for different values of the number of assets 
n and length of time series T (the figures in parentheses denote standard deviations). 



Mod. 


n 


T 


qo 


Qi 


92 


<?2 /qo 


q%l qi 


I 


100 


600 


1.09 (0.01) 


0.92 (0.03) 


1.09 (0.03) 


1.00 (0.03) 


1.19 (0.05) 


I 


500 


3000 


1.09 (0.01) 


0.91 (0.01) 


1.10 (0.02) 


1.00 (0.01) 


1.20 (0.03) 


I 


500 


1500 


1.22 (0.01) 


0.81 (0.01) 


1.22 (0.02) 


1.00 (0.01) 


1.49 (0.03) 


I 


500 


750 


1.73 (0.07) 


0.57 (0.02) 


1.74 (0.07) 


1.00 (0.02) 


3.00 (0.16) 


II 


500 


3000 


1.09 (0.01) 


0.91 (0.01) 


1.10 (0.01) 


1.00 (0.01) 


1.20 (0.02) 


II 


500 


750 


1.72 (0.06) 


0.58 (0.02) 


1.72 (0.08) 


1.00 (0.02) 


2.97 (0.17) 



The results of our simulations are given in Table |l|. It can be seen that go is 
higher than 1 for all values of n and T, as one would expect since the "optimal" 
portfolio obtained from the "noisy" covariance matrix must be less efficient than the 
one obtained from the "true" covariance matrix. It is also clear that g 2 is always very 
close to go, which suggests that "realized" risk can be used as a good proxy for the 
"true" risk when the "true" covariance matrix is not known. This is the very case in 
empirical studies, and therefore on the basis of our simulation results we can expect 
that the values obtained for "realized" risk for example in ||, || must indeed be close 
to the true risk figures. 

This, unfortunately, fails to be true for the predicted risk. As seen from the table, 
gi is always smaller than g (or g 2 ). Since q\ (actually the numerator of the fraction 
that determines gi) is the only risk figure that can be obtained in a framework when 
one sets up the optimal portfolio based on a finite sample of returns (i.e. when the true 
covariance matrix is unknown) and go is the "true" risk of the obtained portfolio, in 
such cases one will end up with underestimating risk. Therefore, optimization in the 
presence of noise will bias risk measurement and lead to the underestimation of the 
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risk of the optimal portfolio. These conclusions are in perfect qualitative agreement 
with those in 0, ||. The important point is the magnitude of the effect, however. It 
is obvious, and also born out by Table [IJ that the effect of noise should decrease with 
r = n/T, i.e. the risk measures go an d gi should converge to 1 as the length of the time 
series goes to infinity, with the size of the portfolio kept constant. 

For T, n — > oo and r = n/T = fixed one can, in fact, calculate go and gi analytically. 
Since the variance of a portfolio is a rotation invariant scalar, it can be evaluated in 
the principal axis system of the covariance matrix. In terms of the eigenvalue density 



pW = 2^ A (3) 



of the covariance matrix (see 0), where A± = (1 ± y/r) 2 , q can be written [T(J as 



* = J>(A)/AdA ■ < 4 » 



Simple integration yields go = 1/ — r. Similarly, for g x one can obtain g x = yjl — r. 
It is easy to verify that these asymptotic formulae fit the simulation results very well. 
Also note that, as we have argued in ||, when we have sufficient information about 
the portfolio, i.e. when n/T is small enough (such as for example in the first two 
rows in the table, or less), then go is not dramatically higher that 1, i.e. the inefficiency 
introduced by noise may not necessarily be very large. We have repeated our numerical 
experiments for Model II and obtained very similar results (see the last two rows of 
Table I) . 

In our view, the main message to be inferred from the above analysis is the bound 
it implies for the noise- induced error. According to the above formulae for g and gi, 
within the framework of our toy Model I the error in the risk estimate is about r/2 for 
small r (and we know that predicted risk is always smaller than the true risk!) As a 
result of inevitable additional imperfections (non-stationarity, deviation from normal 
statistics, etc.), the error in the risk estimate of real-life portfolios can only be larger 
than this, so r/2 can be viewed as a lower bound on the error. Conversely, if we set a 
value for the acceptable error, we have a bound on the ratio n/T . Since the length of 
meaningful time series is always limited (by changes in the composition of the portfolio, 
changes in the regulatory environment, in market conditions, etc.), this means that we 
have an upper bound on the size of the portfolio whose risk can be estimated with the 
pre-determined error. The filtering technique proposed in [|], [5] can then be regarded 
as a tool to break through this upper bound. 

Next, we compare our simulation results with results obtained for covariance ma- 
trices determined from empirical return series. To accomplish this, we use daily return 
series on 400 major US stocks during the period 1991-1996 (1200 observations for each 
stock). The data have been extracted from the the same dataset as in [|J|] Four 

1 We thank J. -P. Bouchaud and L. Laloux for making the dataset available for us. 
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non-overlapping samples of n = 100 stocks were created and divided into two periods 
of length T = 600. For each sample, the first period was used for optimization and 
the calculation of "predicted" risk, while the second period for determining "realized" 
risk. We found that the ratio of "predicted" and "realized" risk q^jqx was 1.36 ± 0.07, 
somewhat higher than in our simulations for the same n and T (1.19). To see whether 
this was caused simply by an increase in volatility in the second period or not, we re- 
peated our calculations with swapping the two periods in each sample, but we obtained 
similar results (1.44 ± 0.08). The additional bias in ^2/91 could be caused, however, 
by volatility dynamics (e.g. conditional heteroskedasticity). In order to diminish the 
effect of possible inhomogeneities in volatility we divided the sample such that every 
other value from the return series was allocated to the first "period" while the rest to 
the second. In this case we obtained 52/^1 = 1-22 ± 0.06, a result in line with our sim- 
ulations. In view of these latter findings, the results obtained e.g. in [|J can be easily 
understood. In the empirical example of 0] with n = 406 and T = 654, "realized" risk 
exceeded "predicted" risk by a factor of around 3, which seems reasonable if one takes 
into account that for this case the simulations, or the theory of the toy model, predict 
a value of 2.6. 

Finally, we studied the effect of noise on portfolios which were selected somehow 
independently from the covariance matrix data. For example, one could invest in equal 
proportions in each asset, or concentrate the portfolio to a few assets (e.g. stocks from 
one single sector). We analyze therefore the implications of noise on the risk mea- 
surement of portfolios whose weights are determined independently of the covariance 
matrix data. Let us consider a portfolio with weights Wi fixed (2£=i w i — !)■ in a 
simulation setup in which covariance matrices are generated as before, we compare the 
"true" risk of the portfolio (as measured by the square root of Z)ij=i w i a if w j) to the 
"observed" risk (deduced from J27j=i w i a if w i)i anc ^ ^ 9 denote the ratio of the sec- 
ond to the first. Our simulations show that for large enough time series length T, this 
quantity is very close to 1, no matter whether the number of assets n is small or large. 
For example, in the case in which the "true" covariance matrix is the n x n identity 
matrix and the weights Wi are chosen to be 1/n for all i, for T = 100 and n = 10, one 
obtains q = 1.00 ± 0.07, for T = 1000 and n = 10, q = 1.00 ± 0.02, while for T = 100 
and n = 100, q = 1.00 ± 0.07 (the mean is 1 and the standard deviation is of the order 
of 1/y/T). The results obtained for other choices of the covariance matrix af^ and 
of the weights Wi are similar, therefore we are led to the conclusion that the effect of 
randomness on the risk estimate of portfolios with fixed weights is very limited. 
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3 Conclusion 

In this paper we have studied the implications of noisy covariance matrices on portfolio 
optimization and risk management. The main motivation for this analysis was the 
apparent contradiction between results obtained on the basis of random matrix theory 
and the fact that covariance matrices are so widely utilized for investment or risk 
management purposes. Using a simulation-based approach we have shown that for 
parameter values typically encountered in practice the effect of noise on the risk of the 
optimal portfolio may not necessarily be as large as one might expect on the basis of 
the results of |I|, @, [|. The large discrepancy between "predicted" and "realized" risk 
obtained in f|, ||] can be explained by the low values of T/n used in these studies; 
for larger T/n the effect becomes much smaller. The analytic formulae derived in this 
paper provide a lower bound on the effect of noise, or, conversely, an upper bound on 
the size of the portfolios whose risk can be estimated with a prescribed error. Finally, 
we have shown that for portfolios with weights determined independently from the 
covariance matrix data, the effect of noise on the risk measurement process is quite 
small. 

A very interesting topic for further research would be to analyze the magnitude of 
these effects for portfolios constructed by techniques really used in practice, for example 
by an asset manager or hedge fund, since these methods often combine sophisticated 
optimization schemes with more subjective expert assessments. 
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